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IN THE CLAIMS: 

1 . (Currently Amended) A method of detecting speech in an incoming signal comprising 
the steps of: 

receiving said incoming signal, extracting an estimate of the noise background of the 
incoming signal and suppressing the noise background of the incoming signal to provide a noise 
suppressed signal in which the estimated background noise has been removed, filtering the noise 
suppressed signal in which the background noise has been removed with a spectral inverse filter, said 
spectral inverse filter is determined by spectrum maxima and the inverse filtering operation 
comprising the steps of: 

in the logarithmic (dB) domain, removing the mean spectral magnitude from the original 
speech spectrum, 

in the mean removed short term frequency spectrum S(i), (7=1 . . .128), detennining all the 
frequency position (Pj), whose magnitudes are maxima over a window centered around Pj and 
stretching N positions to the left and right of Pj, 

in the list of peaks, adding the first (i=l) and last (i=128) frequency positions, their associated 
magnitudes set equal to the mean of the first and last M x N magnitudes, respectively, wherein said 
M and N are preset constants, 

removing the mean of the peak magnitudes from each peak magnitude, 

if the largest resulting peak magnitude exceeds a predetermined maximum peak value 
MAX_dB_DN, normalizing all peaks so that the largest peaks magnitude becomes MAX_Db_DN, 
and 

the resulting inverse filtering H(i), (i=l . . .128) is defined as the maximum of the normalized 
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peaks and 0 dB, and 

removing the inverse filter from the original spectrum in the logarithmic domain U(i)= S(i)- 
H(i) and measuring the periodicity of the signal from the inverse filter using an autocorrelation 
function to determine whether a signal frame corresponds correspond to a speech frame or not. 

2. (Original) The method of claim 1 wherein said periodicity measurement is defined as: 

Th 

p = max Rx(*) 

Ti 

where 7/ and Th are pre-specified so that the period will range in the range of speech and the signal is 
speech if p is above a given threshold. 

3. (Original) The method of Claim 2 wherein said period is between about 75 Hz and 400 

Hz. 

4. (Previously Presented) The method of claim 2 where said threshold value is set to 
maximize speech detection accuracy. 

5. (Original) The method of claim 1 wherein said extracting step includes the steps of: 
converting the spectrum of the incoming signal into logarithmic domain, 

removing high frequency components in logarithmic domain by recurrent filtering along the 
time axis, 

establishing an estimate of noise background, converting the estimate into linear domain, and 
suppressing the noise background from the signal, in linear domain. 
Claims 6-7 (Canceled) 
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8. (Currently Amended) The method of claim 10 [[7]] wherein said inverse filtering is 
performed in a log frequency domain and is implemented by subtracting from the original spectrum 
the estimated inverse filtering spectrum. 

9. (Canceled) 

10. (Currently Amended) A noise-resistant utterance detector comprising the steps of: 
accepting a speech utterance input signal, 

removing background noise from the utterance signal according to a spectral subtraction 
method to get a noise subtracted signal, 

inverse filtering the noise subtracted signal with a spectral inverse filter to get an inverse 
filtered signal, wherein said spectral inverse filter is based on a normalized approximation of an 
envelope of a short term speech spectrum derived from a local maxima of said short term speech 
spectrum, 

locating close low-frequency formants in the noise subtracted signal if they exist and 
inserting spectral valleys between said formants before said inverse filtering, 

calculating the autocorrelation from the inverse filtered signal to get an autocorrelation result, 

and 

detecting that a frame of the signal being processed is or is not speech based on a threshold 
applied to the autocorrelation result. 

11. (Currently Amended) The method of claim 10 wherein said spectral inverse filter is 
determined b y s pectr um maxima and the inverse filtering operation by the steps of: 

in the logarithmic (dB) domain, removing the mean spectral magnitude from the original 
speech spectrum, 
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in the mean removed short term frequency spectrum S(i), (7=1... 128), determining all the 
frequency position (Pj), whose magnitudes are maxima over a window centered around Pj and 
stretching N positions to the left and right of Pj, 

in the list of peaks, adding the first (i=l) and last (i=128) frequency positions, their associated 
magnitudes set equal to the mean of the first and last M x N magnitudes, respectively, wherein said 
M and N are preset constants, 

removing the mean of the peak magnitudes from each peak magnitude, and 

if the largest resulting peak magnitude exceeds a predetermined maximum peak value 
MAXdBDN, normalizing all peaks so that the largest peaks magnitude becomes MAX_dB_DN, 

wherein the resulting inverse filter filtering H(i), (i=l... 128) is defined as the maximum of 
the normalized peaks and 0 dB , and 

removing the inverse filter from the original spectrum in the logarithmic domain U(i) ~ S(i) 

if/;) 

12. (New) The method of claim 1 1 wherein said M, N and MAX_dB_DN are pre-selected 
to have the following values: M=5, N=3 and MAX_dB_DN=3.5 dB. 

1 3 . (New) The method of claim 1 wherein said M, N and MAX_dB_DN are pre-selected to 
have the following values: M=5, N=3 and MAX_dB_DN=3.5 dB. 

14. (New) The method of claim 1 1 further comprising removing said inverse filter from the 
original spectrum in the logarithmic domain U(i) = S(i)- H(i). 

15. (New) A method of determining if a signal includes speech, comprising: 
accepting an input signal; 

removing background noise from said input signal according to a spectral subtraction method 
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to obtain a noise subtracted signal; 

inverse filtering said noise subtracted signal with a spectral inverse filter to obtain an inverse 
filtered signal, wherein said inverse filtering is performed in a log frequency domain and is 
implemented by subtracting an estimated inverse filtering spectrum from an original spectrum of said 
input signal; 

calculating the autocorrelation from said inverse filtered signal to get an autocorrelation 
result; and 

detecting that a frame of said input signal is or is not speech based on a threshold applied to 
said autocorrelation result. 
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